The Next Generation’s Personal File System Management

نویسندگان

  • Iru Wang
  • Xuan Yang
  • Yi-Chen Tsai
چکیده

The current file systems are hierarchical, which can cause duplicate storage and cannot represent human’s mind map. In this paper, we explore the possibility of a heuristic, relational personal file system. Regarding each file as a node in the graph, we implement K-means, EM, LDA and Tree Bagging algorithms respectively to group the related files. In this way, we convert the current hierarchical file system to relational file system. We compare the results of these algorithms, the error of K-means, EM, LDA and random forest are 62.33%, 61.4%, 42.33% and 16% respectively. Among all the unsupervised learning, LDA a popular and generative model in topic modeling gives the best accuracy, but still does not surpass supervised learning. Therefore we propose to combine LDA and Tree Bagging algorithm, using the semi-supervised learning to classify the files in the future. In the end, we also discussed the potential of combining latent fator model with above methods to classify a large scale of file sets, thus extending our method from personal file system to corporate file system.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

XML Security in Certificate Management – XML Certificator

The trend of rapid growing use of XML format in data/document management system reveals that security measures should be urgently considered into next generation’s data/document systems. This paper presents a new certificate management system developed on the basis of XML security mechanisms. The system is supported by the theories of XML security as well as Object oriented technology and datab...

متن کامل

Outlook for the Next Generation’s Precision Forestry in Finland

During the past decade in forest mapping and monitoring applications, the ability to acquire spatially accurate, 3D remote-sensing information by means of laser scanning, digital stereo imagery and radar imagery has been a major turning point. These 3D data sets that use singleor multi-temporal point clouds enable a wide range of applications when combined with other geoinformation and logging ...

متن کامل

XML Security in Certificate Management

Thetrend of rapid growing use of XML format in data/document management system reveals that security measures should be urgently considered into next generation’s data/document systems. This paper presents a new certificate management system developed on the basis of XML security mechanisms. The system is supported by the theories of XML security as well as Object oriented technology and databa...

متن کامل

A qualitative study on personal information management (PIM) in clinical and basic sciences faculty members of a medical university in Iran

  Background: Personal Information Management (PIM) refers to the tools and activities to save and retrieve personal information for future uses. This study examined the PIM activities of faculty members of Iran University of Medical Sciences (IUMS) regarding their preferred PIM tools and four aspects of acquiring, organizing, storing and retrieving personal information.   Methods : The qualita...

متن کامل

Requirements for a Next Generation Personal File Manager

Scientists, engineers, knowledge workers and others need help managing their personal data files and the programs that manipulate their data. The current generation of software for supporting their needs, which we call Personal File Managers (PFMs), is not adequate. We propose five requirements that a next generation PFM should satisfy. We have created a mockup of a PFM which satisfies these re...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013